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Abstract 



In this paper wc show that if A is a subset of the primes with positive relative density 
6, then A + A must have positive upper density Ci(5e-^^(i°s(^/"'»''''(i°s'°s(i/'5))'^' in the 
natural numbers. Our argument uses the techniques developed by Green and Green- Tao 
in their work on arithmetic progressions in the primes, in combination with a result on 
' sums of subsets of the multiplicative subgroup of the integers modulo m. 

"t^ '. 1 Introduction 

In recent years there has been much progress made toward understanding additive properties 
of the primes. One of the first important structural results on the primes is due to Van 
(N ■ der Corput 0], who showed that the primes contain infinitely many three term arithmetic 

progressions. More recently, Green [l| proved a version of Roth's theorem, by showing the 
existence of three term arithmetic progressions in subsets of the primes which have positive 
0^ ■ relative density. In 2004 Green and Tao proved the celebrated theorem that the primes 

contain arbitrarily long arithmetic progressions. 

The strategy developed by Green and Green- Tao is to embed the primes in a 'random' 
Q>^ I set where they have positive relative density and to apply a relative version of Szemeredi's 

■ theorem which holds in this setting. Extending results from additive number theory to 

the setting of random sets with asymptotic density was first considered by Kohayakawa- 
Luczak-Rodl [sl who proved a variant of Roth's theorem. An alternate proof of this version 
of Roth's theorem is proved in Tao-Vu 01 and lends itself to adaptation in the primes 
(similar to Green's proof of Roth's theorem in the primes and used recently in 0]). This 
method of embedding the primes in a 'random' set suggests that one should be able to prove 
other results which are known in a random setting to that of the primes. A result of Laba 
and the second author 0] says that if ^ is a subset of a random set in Zjv := Z/A^Z with 
positive relative density, then A + A must have positive density in Z^v- 

Theorem 1. Suppose that S is a random subset o/Zjv such that the events x ^ S, where 
X ranges over Z^r, are independent and have probability p = p{N) G (CA^~^,1], where 
< 9 < 1/140. Then for every (3 < a, the statement 
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for every set A C S with \A\ > a\S\, we have \A + A\ > /3N 

is true with probability 1 — Oq^^(1) as N ^ oo. 

The main result of this paper is a version of Theorem [1] where A is replaced by a relatively 
dense subset of the primes, which take the role of the random set S. We should note that 
it is known that if V is the set of primes then the density of "P + "P in the natural numbers 
is 1/2 (see, for example, [lot). 

Theorem 2. Let A be a subset of the primes with positive relative density < 6q < 1. 
Then there exist absolute constants Ci and C2 such that A + A has positive upper density 
at least 

Ci<5oe-^2(^°s{l/5o))'/'{loglog(l/<5o))^/^ (1) 

in the natural numbers. 



Remark 3. For 5o = 1, a modification of our argument can be used to show that A + A has 
upper density 1/2 (in particular, one can use the Chinese remainder theorem to prove a 
corresponding version of Theorem |4] below) . For sufficiently small values of 6q, the constant 
Ci can be absorbed into the exponential term. For larger values of 60 we can use the 
boundedness of the argument of the exponential near 5q = 1 to rewrite the density as CiSq 
for a new value of Ci . 

While we do not believe that this bound is best possible, the following example shows 
that it is not possible to extend Theorem [2] to the the analogous conclusion of Theorem [TJ 

Let if denote the Euler totient function, that is, for an integer n, ip{n) is the number 
of integers less than n which are relatively prime to n. Let m be the product of the first t 
primes and define 

A := {p : p= l(mod m)}. 

Let An denote the set of elements of A which are less than or equal to n. The prime number 
theorem for arithmetic progressions implies that 

l^nl = — + o • 
ip{m)iogn log n 

Hence, if n is sufficiently large, we have 

Kl > ^ 

2ip{m) log n 

It follows by the prime number theorem that the relative density of A in the set of primes 
is at least 5 := l/2(^(m). On the other hand, the definition of A implies that 

An + An <Z {s : s = 2(mod m)]. 

Using estimates of Chebyshev and Mertens (for example, see Theorem 8.2 and Theorem 8.8 
in [^) we see that m ~ (/3(m) log log (/^(m), and hence 

, , 2n 5 

\An +An\< — T-rrrn. 

m loglog(l/oj 
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We also remark that it is possible to replace the bound in ([T|) with the weaker bound of 
(5^ using a much simpler argument which uses Cauchy-Schwarz and the Brun sieve. 

One important difference between proving Theorem [1] and Theorem [2] is that while S 
is defined to be a random set in Theorem [H the set of primes is not randomly distributed. 
For example, there is only one prime which is divisible by 2, and if x 7^ 2 is prime then the 
probability that x + 1 is prime is zero. The strategy employed by Green and Green- Tao 
to handle this difficulty is to consider the primes modulo m where m is the product of 
small primes. They then pick one residue class where A C V has large density and find an 
arithmetic progression contained in that residue class. 

In order to bound the density oi A + A we are not able to restrict the arguments to 
one residue class. To prove Theorem [2] we will need a way to consider all residue classes 
for which A has large relative density. The relevant residue classes are contained in the 
multiplicative subgroup of the integers modulo N, which we will denote by Z^. The result 
that we need is contained in the proof of the following theorem. 

Theorem 4. Let < a < 1. Assume that m G Z"*" is sufficiently large depending on a. If 
C N satisfies \B\ > aip{m) then there are absolute constants Ci and C2 such that 

\B + B\> Ciae-^^(i°s(i/"))'''(i°^i°s(^/"))'''m. 

It is not a coincidence that the conclusions of Theorems [2] and H] contain the same 
exponential factor. In fact, it will be evident in the proof of Theorem [2] that this factor 
comes about entirely from the structure of for a suitable modulus m. Furthermore, the 
following example, in conjunction with Freiman's theorem (see, for example. Theorem 5.33 
in [3]), suggests that the l/loglog(l/(5) factor obtained in the previous construction may 
in fact be sharp: 

Suppose m = Pi ■ ■ ■ Ps, where pi < ■ ■ ■ < Ps are the first s primes. Tlien ^772 — ^pj^ x 
• • • X Zp^. Let B C l^m be given in this representation by 

B := {1} X . . . X {1} X (Zp,^, \ {0}) X ... X (Zp, \ {0}). 

Here B is a low dimensional generalized arithmetic progression. Then, using the notation 
of Theorem HI we can directly compute 

a = l/ip{pi . ..pt) 

from the fact that ip{p) = p — 1 for a prime p and the fact that ip is multiplicative. In 
addition, computing B + B and using the same estimates as in the previous example, we 
obtain 

, „ „, inn a 

\B + B\ = r^^TTT"^- 

Pi...pt loglog(l/a) 

2 Preliminaries and an outline of the argument 

Throughout this paper A will be a subset of the primes. An will be a subset of the primes 
which are less than or equal to n and An^ will be those elements of An which are congruent 
to b modulo m, where m is the product of the primes less than or equal to some sufficiently 
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large parameter W. We use |^| to denote the cardinality of the set A and define A + A := 
{a + a' : a, a' € A}. We write C, Ci, or C2 to denote an absolute constant, although 
the exact value of any of these may differ between any two different expressions. For real- 
valued functions / and g, we write f <^ g to mean |/| < C\g\. As previously noted, we 
write to denote the cyclic group Z/NZ and := {x G Z^v : {x, N) = 1} to denote the 
multiplicative subgroup of integers modulo N. 

If / : Zat — 7> C then we define the expectation of / to be 

We define the normalized Fourier transform 

where e{a) := exp(27ria). For two functions f,g : 'Ljq — t- C we define the convolution 

f*g{x) := ^ f{y)g{x-y). 

We also define the U' norm 

ll/llp:=( E \f(-)\'f" 

and the L°° norm 

ll/lloo := sup \f{x)\. 
We will use Plancherel's identity which says that 

E mm = N^' E /(^)^' 

an identity for convolution 

and the Fourier inversion formula 

/(^) = E fiOe{xC/N). 

We will say that a function / : Zjv — )■ R>o is pseudorandom if 

11/(0 - M=o\\oo < V 

for some < rj < 1. 

An outline of our argument is as follows: In Section 3, we begin by partitioning A 
into residue classes modulo m, where m is the product of small primes. We then use 
techniques introduced in [l| to embed each residue class on which A is concentrated into 
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Zjv for N ^ n/m. In this setting, we are able to utilize the concept of pseudorandomness 
to decompose a modified characteristic function of A on each partition (simultaneously) 
into a bounded part and a linearly uniform part. Modifying the arguments used to prove 
Theorem [U we show that the sumset of the images of any two congruence classes of A has 
comparably large density in Zjv. 

In Section 4, we develop a moment estimate needed to prove Theorem HI In particular, 
in Proposition [T^ we prove a A;th moment estimate of the representation function which 
bounds the number of ways to write an element of as the sum of two elements in B. 
An application of Holder's inequality allows us to use the kth moment estimate to prove 
Theorem |4l 

In Section 5, we combine the results of Sections 3 and 4 with an application of Holder's 
inequality to complete the proof of a finite version of Theorem [2l 

3 Sumsets and uniformity of the primes in residue classes 

The main goal of this section will be to prove Proposition [6] below. Before stating Proposi- 
tion [6] we will state a finite version of Theorem [2] which will allow us to introduce necessary 
notation. 

Let < 5o < 1. Let ^4 be a subset of the primes with positive relative density Jq- This 
means that 

hmsup — — — = do, 

n— i>oo I ' n I 

and hence there exist infinitely many n so that ^^5^ > 5o/2. Theorem E] will then follow 
from a finite version, where 6 := 6q/2: 

Theorem 5. Let An C Vn satisfy \An\ > ^IP^I. Then there exist absolute constants Ci 
and C2 such that if n > no(5) then 

Let e > be a small parameter and let W be sufficiently large depending on 5 and e, 
and satisfying W <^ log log n. Set 

m = JJ p. 

p<W 

We begin by partitioning An into congruence classes modulo m. More specifically, let 

A^f) = {a e An : a = b (mod m)} , 

and 

= g 77^ : p = 5 (mod m)} . 

Define 

lA^^^l 

\l n 
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For those sets An'^ for which we have a large relative density in Vn, we say that b is 
good, and we define the set of good residue classes to be 

G = {b£Z*^:Sb> S/2}. 

Combining the methods of Green [l| on three term arithmetic progressions in subsets 
of the primes with the methods used to prove Theorem [1] we are able to show that for any 
pair of good residue classes bi and 62 the sumset A^^^ + A^'^^ is dense in the progression 
{0 < X < 2n : X = 61 + 62 (mod m)}. Summing over pairs of residue classes and being 
careful not to count multiplicities, we have the following: 

Proposition 6. For every x ^ G + G, let 

A,= max (^Jl±^\>5/2. 



(b,b')eGxG \ 2 
b+b' = 



Then for every e > 



|^n + ^n|> V (A^-e)-. (2) 
^-^ m 

xdG+G 

Remark 7. As we will see in the proof of Lemma [TTl we will require m to be a rapidly 
increasing function as 5 and e go to 0. This is the reason that we must gain control on 
|G + Gj in order to prove Theorem [5j 

In the remainder of this section, we will prove Proposition [6l This requires several 
lemmas. 

The first lemma, which was proved by Green Lemma 6.1), allows us to consider An^ 
as a subset of Zjv for ~ n/m. For those b for which the density 5h is large, we will then 
construct a collection of simultaneously pseudorandom measures which majorize modified 
characteristic functions of the images in Z^r of the sets A^^ . Working in Z^v we then use 



Fourier analytic techniques. As our notation differs slightly from that used by Green in 
we include a proof for completeness. 

We recall Green's modified von Mangoldt function \h^m,N ■ '^'^ ~^ defined as 

^ — f '^^TW ^og{mx + 5) if X < A and mx + 6 is prime, 

\ otherwise. 

Lemma 8. Let N G {2n/m,4n/m], and let A^^^ = {m~^{An - b)) n {1, . . . , A} /or 6 G G. 
Then 



h,m,N{x) > r^- 



4(6) 

Proof. By the prime number theorem we have 
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^ logX > ^ 1^(6) (x) log X 



> log(n3/^) 



> 



3 6bn 3 
A(p{m) 4 



for n sufficiently large. Performing a change of variables x — )• + 6 we obtain 

5bn 



Y] I M){x)log{mx + b) > 

x<N 
mx-\-b prime 



By definition of N, 



h,m,N{x) > 6bn/4.mN > 4/16. □ 



Taking C Zj^ we notice that \A^^\ = \An^\ and 



> |^(Ji)+^(j2)| (3) 



for any 6i, 62 G. 
We now define 

and 



From the above lemma, we note that E/^^^ > ^b/lG. Such functions were first defined 
by Green in [l| where a three term arithmetic progression was located in a fixed residue 
class b where A^^ had large relative density. The function / is defined so that it has large 
expectation and u is pseudorandom. We require the following two lemmas of Green ([l|], 
Lemma 6.2 and Lemma 6.6) which express the pseudorandom properties of primes: 

Lemma 9. For N and W sufficiently large there is some D > such that 

^HO) < l + 0{{logN)'^) 

and 

sup|3(OI < 21oglogVF/VF. 
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Lemma 10. Let s > 2. Then there is a constant C{s) such that 

\\ms<c{s). (4) 

In order to prove Proposition [6] we will show that if fei , 62 G G then 

\{x e : (/(^^) * > 0}| > (^^L±i^ - e)N. (5) 

bmce 

fibi) * /(f'2) is supported on A^^'^ + ^SJ'\ assuming ^ implies that the size of this 
sumset must be large. In this case, by ([3]), we must have 

Noticing that An^^ + An'^^ is disjoint from An^^ + An"^^ provided 61 + ^2 7^ ^3 + ^4 proves 
Proposition [6j It is therefore sufficient to prove ([5]). 

Equation ([5]) follows, with modifications, from the arguments in [4[. These arguments 
rely on a Fourier-analytic decomposition of Green [l[ and Green- Tao §], which as stated, 
appears in 0] [see Proposition 5.1] and is also contained in 0| [see Theorem 10.20]. In 
particular the functions f^^^ are decomposed as f[^^ + where is bounded and f2'' 
is unbounded but 'uniform'. 

Lemma 11. Suppose that f = f^^^ and v = u^^'^ are as above. Let s > 2 and let cq > 0. 
Define 

fi{x) := + yi- ^2) : Vi, 2/2 G Bq), 

where 

Bo := {x : |e-2™«/^ - 1| < eo for ah ^ € Ao}, Ao := : 1/(01 > eo}- 

Define f2{x) := f{x) — fi{x). Then for every cr > 0, assuming that N is sufficiently large, 
and W is sufficiently large depending on eo, we have 

< /i < 1 + a, 

(ii) E/i = E/, 

(Hi) II/2II00 < eo/16 and ||/i||oo < 1, 
(iv) ||7i||, «1 /or i = 1,2. 

Proof. The proofs of (ii), (iii) and (iv) follow as in Q], while we reiterate the proof of (i) 
here. 
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In order to bound fi we begin by using Fourier inversion to show that 



< fi{x) 



1^ Yl fi^ + yi-y^) 

- JW^ Y v{x + yi-y2) 



^ E 2:-(0e[ ^ J 



y&Bo 

2 



E e(-ey/iv) 

J/GBo 



We continue by applying Lemma [9] and Plancherel's identity to show that 

2 

2 log log 1 



< Ro)i+ E 



Y.e{-iy/N) 

yeBo 



< l + 0((logAf)-^) + 



2 log log 



W LBn 



E 



SGZjv 



1 + 



2 log log iV 



\Bo\ 

Using the pigeonhole principle, there is a constant c > so that 

l^ol > (ceo)l^°liV. 

Also, 

Ei/(6r+eg|Aoi<c(sr 

where C(s) is given by Lemma \TU[ so 



Therefore 



0</i(s)<l + O 



|Ao| < iCis)/eor. 

( log log W 



\W{ceQ)iC{s)/eo) 

The bound now follows since W is sufficiently large in terms of eo- 
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Remark 12. In the following we will see that we must take eg to be smaller than 6^e^. 
Combining this with the fact that we need the error term in the final equation of the 
above proof to be bounded, we see that m must increase rapidly as 5 and e approach 0, as 
mentioned in the remark following Proposition [H 

Lemma 13. Suppose that f,g : Zjv — C are functions so that 

E(/) = a, 
E{g) = (3 

and which have the property that they are majorized (respectively) by pseudorandom func- 
tions v, fi : "Zn C, that is, 

< f{x) < u{x) 

and 

< g{x) < n{x) 
for every x € Z^v- Then for every e > 

|{x e : (/ * g){x) > 0}j > - e)N. (6) 

Proof. Without loss of generality, assume that < a < /? and let cr be a parameter which 
satisfies < o" < e/10. We decompose / = /i + /2 and 5 = gi + 52 as in Lemma \TT\ with 
eo depending on a and a to be chosen later. 

In order to establish ([6]), it suffices to prove a main term estimate 

|{x eZN-.fi* gi{x) > (jaN}\ > - 3a)N (7) 

and three error terms of the form 

\{x G : \fi * gj{x)\ > ^N}\ < aN, (8) 

where (i, j) / (1, 1). 

For the main term, we first notice that since / and g are both nonnegative 

ll/i*5i||i = ll/i 111 11^1 111 = a/3iV^ (9) 
If d?]) were false, then we would have 

li/i * 51 111 < aaN'^ + ail + a){^^^ - 3a)N'^ < a(3N'^ - aaN^ 

which contradicts ([9]). 

For the error terms, we will show the argument for j = 2 (the other estimate follows 
similarly). It is sufficient to show that 



\h*92M<<rN{^N']. (10) 
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Using the convolution identity for Fourier transforms and the Cauchy-Schwarz inequahty, 
we have 

ll/.*52||i= Yl \(f^*92){x)\'' 

= N'Y1 \M)\'\92{0\' 

_ Ar3||?||l/2||-||l/2||7||3/2||^l|3/2 



Hence, to ensure ([TO]) we simply require Eq^^ < a^a^ /200, or equivalently eo < D 



m 



4 Sumsets of Positive Density Subsets of Z 

Let m G 7^^. For B C and x G Z^, denote 

r^ix) = \{{b, b') £ B X B -.b + b' = x}\. 
In this section our main objective is to prove the following: 

Proposition 14. Let a > 0. Suppose m G Tj'^ is squarefree, and let B C ZJ^ satisfy 
\B\ > aip{m). Then there exists an absolute constant C such that if m > niQ = mo(a) and 
k€Z+ then 



As a corollary, we obtain Theorem |4] using Holder's inequality. 

Proof of Theorem [7j Assume that is a set which satisfies the hypotheses of Theorem [H 
We first assume that m is square free and at the end of the proof, we reduce the general 
case to the square free one. 

Using Holder's inequality, for any k G Z+, we have 

Y,rBix)= rB{x)<iYrBix)'] E M 
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The sum on the left is just \B\'^. Hence, Proposition 1141 impHes that 

|5|2fc/(fc-l) 



\B + B\ > 



i/(fc-i) 



l-BP*'^^'' / 2/(fc-l)^-2C(fc-l)2log(fe-l) 



- (|s|^(m))^v(fc"i) ^« ^ ; 

for k > 2. Taking k = [( Jggff,) )'^'] we find 

|5 + S| > ae-^2(log(l/a))2/3(loglog(l/a))l/3^ 

for a sufficiently small. 

To deal with the case when a is not small, suppose is the largest density for which 
we know the theorem is true. Partition B into Bi, . . . , B^a/ao\ sets each of which contains 
exactly aoip{m) consecutive elements of B. We apply the known result to each set Bj for 
1 < J < [a/aoj to obtain 

\Bj+B,\ > «oe-C^2(log(l/ao))2/3(loglog(l/ao))l/^^. 

Summing over all j, we have 

[o/ooj 

\B + B\> \Bj + Bj\ 

[a/aoJ 

> Q,Qe-C2(l°S(l/"o))2/=*(loglog(l/ao))i/3^ 



> C'iae"^2(^°s(i/"))'/^(iogiog(i/a))V3 



m 



as desired. Note that the constant Ci = e-C2(i°g(V«o))=/3(iogiog(i/ao))i/3 where oq is the 
largest a for which we know the result is true. This completes the proof when m is square- 
free. 

We reduce the general case to the squarefree one by letting 

mi = JJp, 

p\m 

and considering the intervals /j = [jnii, {j + l)mi) for j = 0, . . . ,m/mi — l. Let Bj = BDlj, 
denote aj = and let J = {j : aj > q/2}. Notice that 

m/mi—l 

Em 
. , ^ mi 
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and so 

Ea m 
a,, > . 
^ - 2 mi 

Considering Ij and Bj as subsets of integers, we note that (Bj + Bj) {Bi + Bi) = for 
i j- For each j S J we apply the theorem to the translate Bj — jrrii to get 

Therefore, as subsets of integers, 

> C7iaje-^2(log{l/a,))2/3(loglog(l/a,))l/3^^ 

> C7ie-^2{log(l/a))2/3{loglog(l/a))V3^^ 

> Ciae"^2(log(l/a))^/='{loglog(l/a))i/3^ 

as desired. □ 



Proof of Proposition 14 '■ Let i? be a set satisfying the hypotheses of the lemma, and 
suppose that the prime factorization of m is m = pi . . .p^. Define 

R{x) :=|{(6, r)(^BxZ*^:h + r = x}\ 

= \{b^B:h^x (mod pi), . . . , 6 ^ X (modp()}|. 

Then R{x) is simply counting the representations of x as a sum involving an element in 
B and an element taken from the whole of ZJ^, and in particular R{x) > rsix). Although 
R{x) is larger than the function rsix), it is easier to control. Our goal is to produce a good 
upper bound on the kth. moment of R{x). Define 

S:=Y^ R{x)K 

We begin by separating the values of x into partitions based on the value of (x,m). 
More specifically, for d\m let 

Xd ■.={x € [0, ?n — 1] : (x, m) = d] 

={x € [0, m — 1] : X = for some / E [0, m/(i — 1] with (Z, m/d) = 1}. 

Then we have 

5" = ^ ^ R{xf = ^ ^ \{heB -.h^x (mod p) for aU p\m/d}\'' 

d\iri xdX^ d\m xdX^ 

since the conditions b ^ x (mod p) for p\d are certainly satisfied by every 6 G by the 
condition {b,m) = 1. 
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Now, we denote the inner sum by Sd, so that 

d\m 

Expanding the kth. power in 5,^ and rearranging the order of summation gives 

fei,...,f)feG-B ^ex^ 

b-^ , . . . ,bi,^x (mod p) for all p\m/d 

Now, fix a fe-tuple 61, . . . , 6^ € i?. For this /c-tuple, the contribution of the inner sum above 
is 

\X(i n {x G [0,m — 1] : X ^ bi, . . . ,bk (mod p) for all p\m/d}\ (11) 
which is the same as 

|{Z G [0, m/d — 1] : (/, m/d) = 1 and Z ^ 61, . . . , 6fc (mod p) for all p\m/d}\ 

= \{l £ [0,m/d - 1] : I ^0,bi, . . . ,bk (mod p) for all p\m/d}\. 

Defining 

rp{bi, . . . ,bk) ■= \{s G [0,p — 1] : bi = s (mod p) for some i = 1, . . . , k}\ 
we see that estimating (fTT]) is equivalent to estimating 

p\m/d p\rn/d 

Hence, we have shown that 

bi,...,bkeB p\m/d ^ y J 

In order to bound this sum from above we need to understand the function rp{bi, . . . , bk)- 
We notice that if p is much larger than k, then a random fc-tuple will intersect k distinct 
residue classes (mod p) with high probability, and so rp{bi, . . . ,bk) is typically of size k. 
The following lemma quantifies this fact. 

Lemma 15. For bi, . . . ,bk £ B let 

f{bu...M)= E I- 

rp(6i,...,6fe)<fc-l 

Then there is an absolute constant c > such that for every /? G we have 
\{bi, ...,bkeB: f{bi, ...,bk)>P}\< k'2-'^PWck^^\Bf-2^^^^2_ 
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Proof. The result will follow from optimizing a double-counting argument on the quantity 



E 



E - 



over positive integer values of I. 

Upper Bound: We expand out the exponent and rearrange summation to get 

1 



E 



Pi,---,Pl\m 



Pi • --Pl 



E 



1. 



6;=bj (mod lcm(pi, . . . ,p,)) 



Now, fixing the ^-tuple pi, . . . ,pi, we suppose that the distinct primes among this Z-tuple 
are pi, . . . ,pu. Then the inner sum above is bounded above by 



E 



1 



ip{rny 



(/9(lcm(pi, . . . ,pi)) 
It follows that the original quantity is bounded from above by 



x = y (mod lcm(pi, . . . , pi)) 



Pl,--- ,Pi|m 



Pl. . .pi(p{lcm{pi,. . . ,pi)) 



Splitting the remaining sum based on whether p is greater than or less than f and analyzing 
appropriately we see that the inner sum over p|m is smaller than 



Ej+E;^ « + 

< log(0- 



dx 



2<p<l'- 



n>l' 



.1+1/1 



(13) 



It follows that 



E 

bi,bj£B 

where c > is some constant. 
Lower Bound: Let 

K = K{j3) = < 



E - 

^ p 

\ p\-m. / 



< (^(m)2(clog(/))' 



bi, . . . ,bk € B : 



E 



rp{b^,...,bf.)<k-l 



P 



15 



Given bi, . . . ,bk € -ftT, we have 



> 



1 



P p 

i,j — l,...,k p\m p\m 

ij^j p\b.i — bj p\b^ — bj for some i^j 



E 



> |9 



p| m 

rp(6i,...,(,fe)<fe-l 



SO there must be some pair, bi, bj, such that 



Ei 



> 



At least one such pair comes from each /c-tuple bi, . . . ,bk G i^, and a given pair 6^, 6j can 
appear in at most k'^\B\''~^ A;-tuples. We therefore have 



E 

bi,bj£B 



E 



1 i-i-i 'J I 



P 



> 



\K\ 



Combining the upper and lower bounds gives 



It follows that 

Taking 

we find that 



\K\ < 



I 

2 



k^c'{logiy\Bf-^ip{m)\ 



I = exp(/3/cA;2) 



K\ < k^2-^\B\^-'^ip(mY. □ 



We are now ready to proceed with the proof of Proposition [TH We would like to break 
the sum Sd into pieces defined by the behavior of the function rp{bi, . . . , bk)- For technical 
reasons, we must first define some additional notation. Define 



mi := p m2 := p, 

p\m p\m 
p<.3k p>3A; 

and for any d\m define 

:= p d2 := p. 

p\d p\d 
p<3k p>3fc 
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Then we can rewrite (1121) to have 



E ^ n (i-j)? n 

bi,...,bfcGB p\mi/di ^ P\m2/d2 ^ ^ 

. E f n A-fcM±i). 

bi,...,bfeeB ^ p|m2/<i2 ^ / 

For a fixed d and a, hi, ... ^h^ G -B, let 

P = {p|m2/d2 : rp(6i, . . . ,bk) < k - 1}. 
Expanding S'rf as a sum over geometric intervals we have 

j=-oo bi,...,bf.eB p\m2/d2 

/(6l,...,6fe)G[2J,2J + l) 

A: + 1\ TT / A; + 1\"^ 



E E i n 0-^)n(i 

j=-oo 61 6fcei3 ^ p|m2/d2 PG-P 

/((>l,...,6fc)G[2J,2i + l) 

Now, for fixed j and bi, . . . ,big £ B we have 



A; + 1 



< (fc + l)2^'+\ (14) 

where the second to last line follows from the fact that all primes in P are larger than 3 A;. 
Applying Lemma [T5] we now have 

00 

2(fc+l)2J' 



< .(..M)i n 0-^) E E 



e 



p|m2/d2 i=-oo 6i,...,6fees 

/{6i,...,i>fc)e[2i,2J+i) 

< A:V(miM)^|S|'=-VM' n ('l-^]f;e2(^+i)2.2-'=^P(2Vcfc^) 

= k^C^^{mildi)"-^\Bt-\{rnf TT f 1 - (15) 

p|m2/rf2 

where 

00 

Cfc = ^ g2(fc+i)2^-2-'=^P(2Vcfc^). (16) 
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Note that the lower bound on the range of summation in j comes from the fact that for 
every /c-tuple bi, . . . ,bk & B we have f{bi, . . . ,bk) > X]p<fc 1/p ^ log log k > 2 provided k 
is large enough. Note also that the sum is clearly convergent to some constant dependent 
only on k. 

We also note that since we will see that has size e^^ logfc^ ^j^g argument below we 
have absorbed several smaller functions of k into Ck- 

The remainder of the proof follows in two steps. We first find a bound for S in terms of 
Ck by summing J2d\m'^d and then we compute an upper bound for Ck- In particular, we 
will show 

S<^i^fff (17) 

and 

Cfe <e^'='i°sW. (18) 
We start by proving ()17p . Summing ()15p over all d\m, we have 

S<eCk\B\>^~Mmfy2^im,/d,)^ IT " ^ 

a\m P\m2/d2 

Noticing that ip(nii/di) < mi we have ip{mi/di)in2 < m. Further, we observe that the 
number of divisors d of m which will give the same d2 is the number of ways of choosing 
which primes p < 3k will be factors of d. In other words, there are fewer than 



'3k 



E 



I — ^^'^ 

t 



i=0 

such values of d. We can now rewrite the bound on S as 



d2\m2 p\m2/d 



iiA p ) J^.J^Y\p\d2iX-ik + ^)lp) 



p\m2 d2\m2 



= ci„-w™n(i-^)(i+ 

p\m.2 

< cia-w-n(i-^)' 



p — A: — 1 



(19) 



where the last step follows by recalling that primes dividing m2 are larger than k. Using 
the fact that \B\ = aip{m) and the fact that ip{m) = 'mY\p^^^{l — 1/p), the bound becomes 



S < 



Cl \B\^^{mf 



p\mi 
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The remaining product is less than {3k)^, which is smaller than Cfc. 
It remains to prove (llSp . Recall that 

oo 

Ck = Y^ g(fc+l)2^+l-exp(2Vcfc2)log(2) ^^20) 
j=0 

Expanding out the exponential function in the exponent, we see that the entire exponent 
is smaller than 

2(. + l)2^«-log(2)-|,-^. 

We notice that if j is larger than log2(c^A;^(/c + 1)), then 

2^^ ■ . 



2c2A;4 



Hence for j > log2(4c^fc^(A: + 1)), the exponent is smaller than log(2) — 2^ /ck'^ and so the 
tail of the sum is bounded by e^^^^^ \ Furthermore, we find that for small j, the exponent 
is maximized when 

2^ = ck^log{4ck^{k + l)/log2). 



Hence, splitting the sum in (j20p . we have 



Inequality (|T8|) follows. 
Combining (fTTl) and (fT8]l . we have 



^ — 2 k^l 

as desired. □ 



5 Completion of the Proof of Theorem [5] 

In this section we complete the proof of Theorem O which implies Theorem [21 the main 
result of this paper. 

Let n be sufficiently large, let An C Vn satisfy \An\ > and let e > 0. Suppose that 

G is as in Section 3, and let a be such that |G| = aip{m) (in particular, a > 5/2). Then, 
by Proposition El for every W sufficiently large in terms of 5 and e we have 

\An + An\> V (A, -e)- (21) 
^-^ m 

x&G+G 

where m = np<iy?'- Since 
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we can show that 



2' 



Hence, we also see that 



{b,b')eGxG 



Setting 

r(x) = b') £G xG :b + b' = x}\ 
we can see that ([22]) is equivalent to 



x€G+G 

where 

^ ^ (6,6'): 6+6'=x ^ ^ 

Using Holder's inequality, we have 



k / \ k 



\xeG+G / \xeG+G / 



Applying Proposition [T3] we find 



k-1 
k 



6 ( a^l^ 



2 I eCfc2iog{fc) 
Because k/{k — 1) > 1 and A^; < 1, we have 



E ^ ( 2 ) « ^ ^ ^-1 



(fc) 

m. 



x€G+G 

Proceeding as in the proof of Theorem S] and then returning to (j2ip , we have 

\An + > (ci5e"^^(i°s{i/5))^/^(iogiog(i/5))V3 _ ^ 

for every e > 0. 
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